智能论文笔记

Analysis and application of multispectral data for water segmentation using machine learning

Shubham Gupta , Uma D. , Ramachandra Hebbar

分类：计算机视觉

2022-12-16

Monitoring water is a complex task due to its dynamic nature, added pollutants, and land build-up. The availability of high-resolu-tion data by Sentinel-2 multispectral products makes implementing remote sensing applications feasible. However, overutilizing or underutilizing multispectral bands of the product can lead to inferior performance. In this work, we compare the performances of ten out of the thirteen bands available in a Sentinel-2 product for water segmentation using eight machine learning algorithms. We find that the shortwave infrared bands (B11 and B12) are the most superior for segmenting water bodies. B11 achieves an overall accuracy of $71\%$ while B12 achieves $69\%$ across all algorithms on the test site. We also find that the Support Vector Machine (SVM) algorithm is the most favourable for single-band water segmentation. The SVM achieves an overall accuracy of $69\%$ across the tested bands over the given test site. Finally, to demonstrate the effectiveness of choosing the right amount of data, we use only B11 reflectance data to train an artificial neural network, BandNet. Even with a basic architecture, BandNet is proportionate to known architectures for semantic and water segmentation, achieving a $92.47$ mIOU on the test site. BandNet requires only a fraction of the time and resources to train and run inference, making it suitable to be deployed on web applications to run and monitor water bodies in localized regions. Our codebase is available at https://github.com/IamShubhamGupto/BandNet.

translated by 谷歌翻译

Adapting Pretrained Text-to-Text Models for Long Text Sequences

Wenhan Xiong , Anchit Gupta , Shubham Toshniwal , Yashar Mehdad , Wen-tau Yih

分类：自然语言处理

2022-09-21

我们提出了一项实证研究，以适应现有的经过验证的文本对文本模型，以备长期输入。通过沿预训练管道的三个轴的全面研究 - 模型架构，优化目标和训练式语料库，我们提出了一种有效的食谱，以从现有的短篇小说模型中构建长篇小说模型。具体而言，我们用汇总仪的块关注替换了变压器中的全部注意力，并使用蒙版的跨度预测任务为模型预算，长度不同。就训练训练的语料库而言，我们发现，与使用通常在其域覆盖范围中通常受到限制的现有长文档语料库相比，使用大型开放域语料库的随机串联的短篇小说可以提高性能。通过这些发现，我们建立了一个长篇文本模型，该模型可以在长篇文本质量检查任务上实现竞争性能，并在五个长文本摘要数据集上建立新的最新技术，通常优于先前的方法，具有较大的模型大小。

translated by 谷歌翻译

A Survey on Temporal Graph Representation Learning and Generative Modeling

Shubham Gupta , Srikanta Bedathur

分类：机器学习

2022-08-25

时间图代表实体之间的动态关系，并发生在许多现实生活中的应用中，例如社交网络，电子商务，通信，道路网络，生物系统等。他们需要根据其生成建模和表示学习的研究超出与静态图有关的研究。在这项调查中，我们全面回顾了近期针对处理时间图提出的神经时间依赖图表的学习和生成建模方法。最后，我们确定了现有方法的弱点，并讨论了我们最近发表的论文提格的研究建议[24]。

translated by 谷歌翻译

HTML版本

Image denoising in acoustic field microscopy

Shubham Kumar Gupta , Azeem Ahmad , Prakhar Kumar , Frank Melandso , Anowarul Habib

分类：计算机视觉

2022-08-07

由于显微镜图像被广泛用于生物医学或材料研究，因此已使用扫描声显微镜（SAM）。声学成像是无损测试（NDT），生物医学成像和结构健康监测的一种重要且建立的方法。这些成像经常以低振幅的信号进行，这可能导致引导液噪声，缺乏嘈杂的和缺乏的。图像信息的详细信息。在这项工作中，我们试图分析从低振幅信号中获取的SAM图像，并在时间域信号上使用块匹配过滤器以获得DeNocer的图像。我们已经将图像与在时间域信号上应用的常规过滤器进行了比较，例如高斯滤波器，中值滤波器，Wiener滤波器和总变化过滤器。著名的结果在本文中显示。

translated by 谷歌翻译

Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

Aarohi Srivastava , Abhinav Rastogi , Abhishek Rao , Abu Awal Md Shoeb , Abubakar Abid , Adam Fisch , Adam R. Brown , Adam Santoro , Aditya Gupta , Adrià Garriga-Alonso

分类：自然语言处理 | 人工智能 | 机器学习 | (统计)机器学习

2022-06-09

语言模型既展示了定量的改进，又展示了新的定性功能，随着规模的增加。尽管它们具有潜在的变革性影响，但这些新能力的特征却很差。为了为未来的研究提供信息，为破坏性的新模型能力做准备，并改善社会有害的效果，至关重要的是，我们必须了解目前和近乎未来的能力和语言模型的局限性。为了应对这一挑战，我们介绍了超越模仿游戏基准（Big Bench）。 Big Bench目前由204个任务组成，由132家机构的442位作者贡献。任务主题是多样的，从语言学，儿童发展，数学，常识性推理，生物学，物理学，社会偏见，软件开发等等。 Big-Bench专注于被认为超出当前语言模型的功能的任务。我们评估了OpenAI的GPT型号，Google内部密集变压器体系结构和大型基础上的开关稀疏变压器的行为，跨越了数百万到数十亿个参数。此外，一个人类专家评估者团队执行了所有任务，以提供强大的基准。研究结果包括：模型性能和校准都随规模改善，但绝对的术语（以及与评估者的性能相比）；在模型类中的性能非常相似，尽管带有稀疏性。逐渐和预测的任务通常涉及大量知识或记忆成分，而在临界规模上表现出“突破性”行为的任务通常涉及多个步骤或组成部分或脆性指标；社交偏见通常会随着含糊不清的环境而随着规模而增加，但这可以通过提示来改善。

translated by 谷歌翻译

Differentiable Rule Induction with Learned Relational Features

Remy Kusters , Yusik Kim , Marine Collery , Christian de Sainte Marie , Shubham Gupta

分类： (统计)机器学习 | 机器学习

2022-01-17

基于规则的决策模型由于其可解释性而具有吸引力。但是，现有的规则诱导方法通常会导致长期且因此不容易解释的规则模型。这个问题通常可以归因于缺乏适当表达性的词汇，即决策模型中用作文字的相关谓词。大多数现有的规则归纳算法都假定了预定义的文字，从而自然地将文字的定义与规则学习阶段解耦。相比之下，我们提出了关系规则网络（R2N），这是一种神经体系结构，学习了代表数值输入特征之间线性关系以及使用它们的规则的文字关系。这种方法通过直接以端到端的方式将文字学习与规则学习联系起来，为提高诱发决策模型的表现力打开了大门。在基准任务上，我们表明这些学识渊博的文字足够简单，可以保留可解释性，但提高了预测准确性，并提供了与最先进的规则归纳算法相比更简洁的规则。

translated by 谷歌翻译

A Differentiable Recipe for Learning Visual Non-Prehensile Planar Manipulation

Bernardo Aceituno , Alberto Rodriguez , Shubham Tulsiani , Abhinav Gupta , Mustafa Mukadam

分类：机器人 | 人工智能

2021-11-09

使用视频指定任务是获取新颖和一般机器人技能的强大技术。然而，推理机械和灵巧的互动可以使其挑战规模学习接触的操纵。在这项工作中，我们专注于视觉非预先展示平面操作的问题：给定平面运动中对象的视频，找到再现相同对象运动的联系人感知机器人动作。我们提出了一种新颖的架构，可微分的操纵（\我们）的学习，它通过利用可微分优化和基于有限差分的模拟来将视频解码与接触机械的前沿的视频解码神经模型结合在一起。通过广泛的模拟实验，研究了基于模型的技术与现代深度学习方法之间的相互作用。我们发现，我们的模块化和完全可差的架构比看不见的对象和运动的学习方法更好。 \ url {https://github.com/baceituno/dlm}。

translated by 谷歌翻译

Optimal and Efficient Dynamic Regret Algorithms for Non-Stationary Dueling Bandits

Shubham Gupta , Aadirupa Saha

分类：机器学习 | 人工智能

2021-11-06

我们在非稳定性或时间变化偏好下，在$ k $的武器{动态遗憾最小化}中研究了\ mpph {动态遗憾最小化}。这是一个在线学习设置，其中代理在每个轮中选择一对项目，并仅观察该对的相对二进制`的次数“反馈，从该圆的底层偏好矩阵中采样。我们首先研究对抗性偏好序列的静态后悔最小化问题，并使用$ O（\ SQRT {kt}）为高概率遗憾设计了高效的算法。我们接下来使用类似的算法思想，提出一种在非实践中的两种概念下的动态遗为最小化的高效且可透明的最佳算法。特别是，我们建立$ \ to（\ sqrt {skt}）$和$ \ to（{v_t ^ {1/3} k ^ {1/3} t ^ {2/3}}）$动态后悔保证，$ S $是基础偏好关系中的“有效交换机”的总数，以及$ V_T $的衡量标准的“连续变化”非公平性。尽管现实世界系统中的非静止环境实用性，但在这项工作之前尚未研究这些问题的复杂性。我们通过证明在上述非实践概念下的符合下限保证匹配的匹配的算法来证明我们的算法的最优性。最后，我们通过广泛的模拟来证实我们的结果，并比较我们算法在最先进的基线上的功效。

translated by 谷歌翻译

Consistency of Constrained Spectral Clustering under Graph Induced Fair Planted Partitions

Shubham Gupta , Ambedkar Dukkipati

分类：机器学习 | (统计)机器学习

2021-05-08

光谱聚类在从业者和理论家中都很受欢迎。尽管对光谱聚类的性能保证有充分的了解，但最近的研究集中于在群集中执行``公平''，要求它们在分类敏感的节点属性方面必须``平衡''人口中的种族分布）。在本文中，我们考虑了一个设置，其中敏感属性间接表现在辅助\ textit {表示图}中，而不是直接观察到。该图指定了可以相对于敏感属性互相表示的节点对，除了通常的\ textit {相似性图}外，还可以观察到。我们的目标是在相似性图中找到簇，同时尊重由表示图编码的新个人公平性约束。我们为此任务开发了不均衡和归一化光谱聚类的变体，并在代表图诱导的种植分区模型下分析其性能。该模型同时使用节点的群集成员身份和表示图的结构来生成随机相似性图。据我们所知，这些是在个人级别的公平限制下受约束光谱聚类的第一个一致性结果。数值结果证实了我们的理论发现。

translated by 谷歌翻译

Federated Learning with Client-Exclusive Classes

Jiayun Zhang , Xiyuan Zhang , Xinyang Zhang , Dezhi Hong , Rajesh K. Gupta , Jingbo Shang

分类：机器学习

2023-01-01

Existing federated classification algorithms typically assume the local annotations at every client cover the same set of classes. In this paper, we aim to lift such an assumption and focus on a more general yet practical non-IID setting where every client can work on non-identical and even disjoint sets of classes (i.e., client-exclusive classes), and the clients have a common goal which is to build a global classification model to identify the union of these classes. Such heterogeneity in client class sets poses a new challenge: how to ensure different clients are operating in the same latent space so as to avoid the drift after aggregation? We observe that the classes can be described in natural languages (i.e., class names) and these names are typically safe to share with all parties. Thus, we formulate the classification problem as a matching process between data representations and class representations and break the classification model into a data encoder and a label encoder. We leverage the natural-language class names as the common ground to anchor the class representations in the label encoder. In each iteration, the label encoder updates the class representations and regulates the data representations through matching. We further use the updated class representations at each round to annotate data samples for locally-unaware classes according to similarity and distill knowledge to local models. Extensive experiments on four real-world datasets show that the proposed method can outperform various classical and state-of-the-art federated learning methods designed for learning with non-IID data.

translated by 谷歌翻译